CDS

Accession Number TCMCG075C03471
gbkey CDS
Protein Id XP_017983843.1
Location complement(join(32559197..32559388,32559481..32559797,32559920..32559995,32560092..32560175,32560280..32560966,32561066..32562405,32563031..32563061,32564051..32564143,32564401..32564466))
Gene LOC18613794
GeneID 18613794
Organism Theobroma cacao

Protein

Length 961aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018128354.1
Definition PREDICTED: uncharacterized protein LOC18613794 isoform X1 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category O
Description protein folding
KEGG_TC -
KEGG_Module M00404        [VIEW IN KEGG]
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko03009        [VIEW IN KEGG]
ko03110        [VIEW IN KEGG]
ko04131        [VIEW IN KEGG]
KEGG_ko ko:K03500        [VIEW IN KEGG]
ko:K09508        [VIEW IN KEGG]
ko:K09519        [VIEW IN KEGG]
ko:K14007        [VIEW IN KEGG]
ko:K19370        [VIEW IN KEGG]
EC 2.1.1.176        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko04141        [VIEW IN KEGG]
map04141        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGCAGGGCGATGAAGCCAGACTCTTGCTAGGCTTCCCCCCTAATTCTCGCCCTACTCCTTCTCAGGTAAAAGCAGCTTATAGAAAGAAAGTATGGGAGTCGCATCCTGACTTGTTTCCTGTTCACGAAAAACCTAAGGCGGAGTCTAAGTTCAAGTTGATTTTTGAAGCTTATACTTGCCTACAGTCTGAGATGGCCCCACTTCGGTCCACAGGATATGTTGACCCTGGATGGGAGCATGGGATTGCTCAAGATGAAAGGAAAAAGAAGGTTAAATGCAACTACTGTGGGAAAATAGTCAGTGGTGGAATATTCAGATTGAAGCAACATTTAGCCAGATTGTCTGGAGAAGTTACTCACTGTGAAAAGGTTCCTGAAGAAGTATGCTTGAATATGAGAAAGAACCTTGAAGGATGCCGTTCTGGTCGAAAACGAAGGCAATCAGAATATGAACAGGCTGCTCTAAATTTCCAATCTAATGAGTACAATGATGCAGAAGAAGCATCGGCAGGTTATAAACACAAAGGCAAGAAAGTGATGGGTGACAAGAACTTGGTCATCAAGTTTACCCCTCTTCGATCATTAGGATATGTGGACCCGGGATGGGAACATTGCGTTGCTCAAGATGAGAAGAAGAAAAGAGTAAAATGCAACTATTGCGAAAAAATAATAAGTGGGGGCATAAATCGGTTTAAGCAACATCTTGCTAGGATCCCTGGAGAAGTTGCATATTGTGAAAAGGCACCTGAGGAGGTATATCTCAAAATCAAAGAAAATATGAAATGGCACCGTACTGGCAGAAGGCATCGAAAACCTGATACCAAGGAGATATCTGCTTTCTACTTGCACTCAGATAATGAGGATGAAGGTGGAGAGGAGGATGGGTATTTGCAATGTATAAGTAAGGACATACTGGCTATTGACGATAAAGTTTCTGATAGTGACATTAGAAATAATAATGTCAGAGGTAGATCTCCTGGTAGTAGTGGTAATGGTGCTGAACCACTACTTAAAAGATCAAGACTGGATTCGGTATTTTTAAAGTCGCTGAAAAGCCAGACATCAGCACACTACAAACAAACAAGAGCAAAAATAGGTTTCGAGAAGAAAACTCGCAGGGAAGTGATATCTGCTATATGCAAATTCTTTTATCATGCAGGAATCCCTTCTAATGCAGCAAACTCTCCGTACTTCCATAAAATGCTGGAAGTGGTTGGTCAGTATGGGCAGGGTTTGCAAGGTCCTTCAAGTCGAATCATATCTGGTCGTCTCCTTCAGGAAGAGATTGCTAATATTAAAGAGTATCTGGCGGAGTTTAAGGCATCTTGGGCTATTACTGGTTGTTCTGTCATGGCTGACAGTTGGAATGATGCACAAGGAAGGACCCTGATTAACTTTTTGGTCTCTTGTCCTCGCGGTGTTTGTTTTCTCTCTTCTGTTGATGCAACTGATATGATAGAAGATGCTGCTAATCTCTTCAAGTTGTTAGACAAAGCAGTGGATGAGGTTGGCGAGGAATATGTAGTCCAGGTAATCACTAGGAACACTTTGAGTTTCAGGAATGCTGGAAAGATGCTTGAAGAGAAAAGGAGAAATTTATTTTGGACACCATGTGCTGTCTATTGCATTGATAGAATGCTTGAGGATTTTTTGAATATAAAATGGGTGGGAGAATGCATAGATAAAGCAAAAAAGGTGACAAGGTTTATTTATAACAATACCTGGTTGTTGAATTTTATGAAGAAAGAATTTACGAAGGGACAGGAACTTCTTAAGCCAGCTGTCACCAAGTTTGGCACTAATTTTTTCACTTTACAGAGTATGTTGGACCAGAGGGTTGGTCTTAAGAAAATGTTCCAATCAAATCGATGGCTTTCCTCCCGCTTTTCCAAATTAGATGAAGGTAAAGAGGTTGAAAAAATTGTCTTAAATGTCACCTTTTGGAAGAAGATGCAGTATGTGAAGAAATCCTTAGAGCCAGTTGCTGAAGTTCTTCAAAAGATAGGTAGTGATGAAATCCGATCAATGCCATTTATCTATAATGACATATGTAGAACAAAGCTTGCAATTAAAGCCATTCATGGTGATGATGTGCGCAAATTTGGACCTTTCTGGAGTGTGATTGAAAACAATTGGAGTTCATTGTTCCATCATCCTCTTTATGTTGCTGCATACTTTCTCAATCCATCCTTCCGTTACTGCCCAGATTTTCTGATGAATCCTGAAGTAATTCGTGGTCTAAATGAGTGTATTGTTCGATTGGAGTCAGACAATGGGAAAAGGATTTCTGCATCCATGCAGATACCTGATTTTGTGTCGGCAAAAGCTGATTTTGGAACTGATTTGGCCATAAGTACTAGAAGTGAGCTTGATCCAGCTTCATGGTGGCAACAACATGGGATAAGTTGCTTAGAGCTGCAACGAATCGCCATACGCATACTAAGCCAGAGATGTTCATCGATTGGATGTCAGCATACCTGGAGTGTGTTTGATCAAGTTCACAGCAAAAGACGCAACTGTTTGTCTCGGAAGAGATTGAATGACCACACCTATGTTCATTACAACTTGCGACTGAGAGAACGCCAACTAGGAAGGAAGCCTGATGATTTGGTTTCCTTTGACAGTGCCATGTTAGAAAGTGTATTAGATGACTGGCTTGTGGAGTCAGAGAAGCAAGCCATGCAAGAAGATGAGGAGATTATTTATAATGAGGTGGAACAATTTTATGGAGATGATATGGATGAACATGTGAGTGAAGAAAAGAGACCTACAGAAATGGTCACGTTAGCTAGTTTGGTTGAACCATTGGATGTTAATCCTGCTGCTGGAGGTGTTACCACTGATGATGATGGTCTCGATTTTCTTGATGATGATTTGACGGATTAG
Protein:  
MQGDEARLLLGFPPNSRPTPSQVKAAYRKKVWESHPDLFPVHEKPKAESKFKLIFEAYTCLQSEMAPLRSTGYVDPGWEHGIAQDERKKKVKCNYCGKIVSGGIFRLKQHLARLSGEVTHCEKVPEEVCLNMRKNLEGCRSGRKRRQSEYEQAALNFQSNEYNDAEEASAGYKHKGKKVMGDKNLVIKFTPLRSLGYVDPGWEHCVAQDEKKKRVKCNYCEKIISGGINRFKQHLARIPGEVAYCEKAPEEVYLKIKENMKWHRTGRRHRKPDTKEISAFYLHSDNEDEGGEEDGYLQCISKDILAIDDKVSDSDIRNNNVRGRSPGSSGNGAEPLLKRSRLDSVFLKSLKSQTSAHYKQTRAKIGFEKKTRREVISAICKFFYHAGIPSNAANSPYFHKMLEVVGQYGQGLQGPSSRIISGRLLQEEIANIKEYLAEFKASWAITGCSVMADSWNDAQGRTLINFLVSCPRGVCFLSSVDATDMIEDAANLFKLLDKAVDEVGEEYVVQVITRNTLSFRNAGKMLEEKRRNLFWTPCAVYCIDRMLEDFLNIKWVGECIDKAKKVTRFIYNNTWLLNFMKKEFTKGQELLKPAVTKFGTNFFTLQSMLDQRVGLKKMFQSNRWLSSRFSKLDEGKEVEKIVLNVTFWKKMQYVKKSLEPVAEVLQKIGSDEIRSMPFIYNDICRTKLAIKAIHGDDVRKFGPFWSVIENNWSSLFHHPLYVAAYFLNPSFRYCPDFLMNPEVIRGLNECIVRLESDNGKRISASMQIPDFVSAKADFGTDLAISTRSELDPASWWQQHGISCLELQRIAIRILSQRCSSIGCQHTWSVFDQVHSKRRNCLSRKRLNDHTYVHYNLRLRERQLGRKPDDLVSFDSAMLESVLDDWLVESEKQAMQEDEEIIYNEVEQFYGDDMDEHVSEEKRPTEMVTLASLVEPLDVNPAAGGVTTDDDGLDFLDDDLTD